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BIOSENSOR MATERIALS AND METHODS 



Technical Field 

This invention relates to biosensor materials and 
5 methods. Disclosed are methods of generating microorganisms 

having utility in biosensing, tools which can be generally 
used in such methods, the microorganisms themselves, and 
biosensing methods employing such microorganisms. 

10 Backcrround Art 

It is frequently desirable to be able to detect small 
concentrations of analytes in samples, e.g. environmental 
samples. For instance, to allow more effective management 
of scarce environmental resources, more efficient and 

15 faster methods of assessing environmental pollution are 

required. At present, molecular-specific monitoring of 
effluent streams and -other environmental matrices requires 
extensive chemical manipulation of the sample followed by 
Gas Chromatography (GO and Mass Spectrometry (MS) 

20 analyses. Although these techniques are highly sensitive, . 

sample preparation is necessarily slow and expensive. 
Consequently, continuous on-site analysis of a variety of 
environmental matrices cannot be achieved using these 
methods at reasonable cost. 

25 An alternative method f or . the determination of phenols 

and chlorophenols has been proposed using a biosensor based 
around Rhodococcus sp . [see Riedel et al (1993) Appl 
Microbiol Biotechnol 38: 556-559]. In this method 
microorganisms are immobilised in an oxygen electrode, and 

30 oxygen uptake in response to added substrates was 

monitored. Although fairly simple and rapid, this method 
lacks robustness and is not sufficiently sensitive or 
specific for detecting particular environmental pollutants. 
It can thus be seen that the provision of novel 

3 5 materials and methods capable of being used in the field of 

biosensing would represent a step forward in the art. 
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Disclosure of Invention 



In a first aspect of the invention there is disclosed 
a method of detecting the presence or absence of an analyte 
5 in a sample comprising the steps of : 

(a) contacting the sample with a microorganism which 
expresses a binding agent capable of binding the analyte, 
wherein the binding of the agent to the. analyte causes a 
detectable signal, and; 
^0 (t)) observing said microorganism for said detectable 

signal; 

characterised in -that said microorganism has been 
transformed such as to improve the detectability of the 
signal . 

By ^^observing" is meant ascertaining by any means 
(directly or indirectly) the presence or absence or the 
selected signal, which is indicative of the binding event. 

By "improve" is meant, inter alia, altering the nature 
of the signal to one which can be observed more readily or 
20 increasing the intensity of the signal (thereby reducing 

the sensitivity of the means used to observe it) 

Thus by using a transformed microorganism, the 
limitations inherent in wild-type microorganisms such as 
those used in the prior art may be overcome. In particular 
25 more sensitive and robust monitoring methods than those 

based on natural oxygen uptake can be employed. Methods for 
generating such transf ormants are described in further 
detail below, -Such transformed microorganisms are 
hereinafter referred to as 'biosensors'. 

Preferably the analyte is an environmental pollutant, 
for instance such as may result from industrial or medical 
applications. Of particular interest is the detection of 
mono- and poly-aromatic, cyclic, heterocyclic and linear 
hydrocarbons such as, but not limited to, components of 
35 fuels, solvents, propellants, energetics and pesticides 

(such as may appear on United States EPA Priority 
Pollutants List and European Community Grey and Black 



2 



Lists) and naturally occurring degradation products of 
these compounds in industrial process media, vapours, 
effluents, raw water, rivers, ground waters and soils. 

Preferably the transformed microorganism is a mycolic 
acid bacterium. These bacteria form a supra generic group 
of Gram-positive, non- sporulat inc bacceria which is 
comprised of the genera Corvnebacner ium , Mvcobacterium . 
Nocardia , Rhodococcus , Gordona and Tsukamu r-^ 1 la . Members 
are metabolically diverse and capable of using as sole 
carbon source (a growth- inducing substrate) wide range of 
natural and xenobiotic compounds, including many key 
environmental ly- toxic and/ or industrially- important 

molecules. The mycolic acid bacneria exhibit "several 
structural and physiological features which appear co be 
specialisations for hydrocarbon degradation, these include 
a hydrophobic mycolic acid outer cell layer and associated 
production of extracellular mycolic acid-derived 
biosurf actants . Most preferably the bacterium is either 
Rhodococcus or Nocardia . 

The detectable signal may be a change in enzyme 
function(s), metabolic function(s) or gene expression. * 

Preferably however the signal is ascertained -in 
consequence to an increased expression of a signal protean 
from a signal gene, more preferably a heterologous signal 
gene. Many suitable signal proteins (which have a readily 
detectable activity) are known in the art e.g. 
Egalactosidase , which can generate a coloured substrate. 
Most preferably the activity of the signal protein, or the 
protein itself, can be estimated photometrically 
(especially by fluorimetry) . For instance green (and red) 
fluorescent protein, insect luciferase, and photobacterial 
lucif erase. Methods for introducing such genes into 
appropriate hosts are described in further detail below. 

Generally the bound agent /analyte complex will 
initiate expression of a signal gene which is operably 
linked to an inducible promoter. The identification of 



suitable promoters and/or coding sequences which are 
operably linked to them (including that of the binding 
protein) in mycolic acid bacteria, in order to modify said 
suitable promoters and/or coding sequences to introduce 
5 signal genes therein forms one part of the present 

invention . 

As used herein, "promoter" refers to a non-coding 
region of DNA involved in binding of RNA polymerase and 
other factors that initiate or modulate transcription from 
10 a coding region of DNA whereby an RNA transcript is 

produced . 

An "inducible" promoter requires specific signals in 
order for it to be turned on or off. 

The terms "operatively linked" and "operably linked" 

15 refer to the linkage of a promoter to an RNA-encoding DNA 

sequence, and especially to the ability of the promoter to 
induce production of RNA transcripts corresponding to the 
DNA sequence when the promoter or regulatory sequence is 
recognized by a suitable polymerase. The term means that 

20 linked DNA sequences (e.g., promoter (s), structural gene 

(e.g., reporter •gene.{s)),, terminator sequence (s), are 
operational or functional, i.e. work for their intended 
purposes . 

As is known to those skilled in the art, the transport 
25 and binding proteins (agents) required for the 

functionality of the inducible promoter, as well as the 
catabolic enzymes induced by it, will frequently form part 
the operon containing the promoter, and may thus be 
identified and isolated along side it using the methods 
30 disclosed above. These additional proteins are hereinafter 

referred to as "operon proteins" . 

Generally speaking, those skilled in the art are well 
able to construct vectors and design protocols for 
recombinant gene expression in common hosts such as 
35 col± . Suitable vectors can be chosen or constructed, 

containing appropriate regulatory sequences, including 
promoter sequences , terminator fragments , polyadenylation 
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sequences, enhancer sequences, marker genes and other 
sequences as appropriate. For further details see, for 
example. Molecular Cloning: a Laboratory Manual: 2nd 
edition, Sambrook et al, 1989, Cold Spring Harbor 
Laboratory Press. Many known techniques and protocols for 
manipulation of nucleic acid, for example in preparation of 
nucleic acid constructs, mutagenesis, sequencing, 
introduction of DNA into cells and gene expression, and 
analysis of proteins, are described in detail in Currant 
Protocols in Molecular Biology, Second Edition, Ausubel et 
al. eds., John Wiley Sc Sons, 1992. The disclosures of 
Sambrook et al . and Ausubel et al . are incorporated herein 
by reference. 

However, the present inventors have recognised chat 
certain methods previously employed in the art which were 
developed for enteric bacteria such as E. coli may not be 
the most appropriate for use in mycolic acid bacteria. The 
mycolic. acid layer and associated biosur f actants (which are 
a defining feature of these bacteria) and thick cell -wail 
confer great resistance to cell lysis protocols known in 
the art. Similarly, mycolic strains used in the invention 
may not be laboratory, type strains, and may thus exhibit 
very high levels of nuclease activity. Accordingly, 
advantageous methods have been developed by the inventors . 
The methods of identifying, modifying and employing novel 
inducible promoters and/or coding regions operably linked 
to them which are appropriate to mycolic acid bacteria are 
detailed below. 

Thus in a second aspect of the invention there is 
disclosed a method for identifying DNA encoding an 
inducible promoter which is induced in response to a 
specific analyte and/or identifying DNA encoding associated 
operon proteins comprising the steps of: 

(a) culturing a source of mycolic acid bacteria in a 
selective medium containing said specific analyte and being 
selective for oligo trophic bacteria , 

(b) identifying bacteria capable of subsisting on said 
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medium, 

(c) extracting DNA from said bacteria 

(d) incorporating said DNA into vectors 

(e) cloning said vectors into a suitable host cells 

5 (f) screening the host cells for said inducible promoter 

and/or proteins in order to identify vectors encoding it. 

By "screening" is meant subjected to analysis in order 
to determine the presence or absence of a particular 
defined property or constituent. Methods of screening are 
10 discussed in more detail below. 

As is known to those skilled in the art "oligot rophic 
bacteria" are bacteria which exhibit a preference for, and 
persistent slow growth on, low levels of carbon sources. 
These bacteria are adapted to and predominate in 
15 carbon-poor environments (predominantly aquatic habitats 

where carbon is limiting to levels) , These bacteria are 
rarely capable of the very rapid growth as exemplified by 
the enteric bacterium E. coli . but are by contrast, 
extremely persistent and metabolically versatile. 

Preferably the m.edium used in the second aspect is a 
defined minimal medium called hereinafter ^MMRN' which has 
been developed by the present inventors to screen for 
oligotrophic mycolic acid-containing bacteria likely to 
form the basis of the biosensor. This medium preparation 
a derivative of von der Osten et al.(l989) but for 
mycolic acid bacteria sodium citrate and biotin have been 
shown, to be unnecessary. Most importantly, the level of 
carbon supplement is reduced to oligotrophic levels (<500 
fiM, more preferably <100 fiM) . Experiments show that MMRN 
30 facilitates simple, selective enrichment for oligotrophic, 

mycolic acid-containing bacteria as well as providing the 
basis for testing and characterisation of gene induction. 
The medium forms a third aspect of the present invention. 

DNA may be extracted from the bacteria by any methods 
35 known in the art. However, the present inventors have 

demonstrated that DNA isolation from mycolic acid soil 
bacteria using standard techniques is inefficient. 
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Accordingly, several optimised methods of generating total 
DNA from mycolic bacteria have been developed, as described 
in more detail below (Examples 3 and 4) . These involve 
bacterial culture in MMRN supplemented with L-glycine, 
oligotrophic levels of carbon source (80 uM) and removal of 
biosurf actant s by washing in a non- ionic detergent (e.g. 
Tween 80) prior to a modified alkaline lysis technique. The 
concept of using a non-ionic detergent ac between 0.05 - 
0.5 % (preferably 0.1%) in order to facilitate DNA 
extraction is central to the novel, optimised methods. 

"Vector", unless further specified, is defined to 
include, inter alia, any plasmid DNA, iysccenic phage DNA 
and/or transposon DNA, in double or single stranded linear 
or circular form which may or may not be self transmissible 
or mobilizable, and which can transform a prokaryotic or 
eukaryotic host either by integration into the cellular 
genome or exist extrachromosomally (e.g. autonomous 
replicating plasmid with an origin of replication) . 

Preferably the host used is E . col i . iMore preferably 
it is ■ an E. coli strain carrying one or more of the 
mcrABC mrr hsdSRM rec A and rec Q mutations, since this is 
believed to enhance clone recovery when using DNA derived 
from mycolic acid bacteria. Gene libraries may be readily 
maintained in these strains . 

Preferably the vector used with E. coli further 
incorporates the cos element (which is well known to those 
skilled in the art) . Because of their capacity and 
selection for large DNA inserts and efficient transfection 
rates, cosmid cloning vectors facilitate rapid gene library 
construction. 

Preferably bacteria are further screened, for instance 
after stage (b) , to ensure an absence of catabolic 
repression- Catabolite repression could seriously 
compromise the activity of a biosensor since the presence 
of a more efficient carbon source (such as glucose, 
succinate or acetate etc.) would lead to repression of 
hydrocarbon catabolic pathways which forms the basis of the 
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sensor. To identify strains lacking catabolic repression, 
the concentrations of an enzyme known to be associated with 
the catabolic pathway of interest (e.g. catechol 

2, 3-dioxygenase, which is associated with toluene 
catabolism) is assessed in (a) selective medium 
supplemented with the specific analyte, (b) selective 
medium supplemented with the specific analyte plus a high 
efficiency carbon source such as glucose (1 mM) and (c) 
selective medium supplemented with glucose (1 mM) alone. 
Enzyme activities should be very low or undetectable in the 
absence of analyte. In the presence of analyte, and glucose 
plus analyte, the activities should be, within experimental 
error, very similar. 

The present inventors have established that indicates 
15 that catabolic genes in mycolic acid bacteria exhibit poor 

DNA sequence conservation with analogous enzyme genes in 
Gram negative bacteria. As a result, "reverse genetic" 
approaches to isolation of novel catabolic pathways are 
likely to be of limited use when using such published 
20 sequence data. 

Thus in one embodiment of the second aspect, the host 
cells are screened for the inducible promoter and/or operon 
proteins by screening the cells using one or more probes 
based on the sequence of other promoters and/or operon 
proteins employed by mycolic acid bacteria in catabolic 
enzyme production. One example of a source of suitable 
sequences is the promoter operator region of the R. 
corallina mac (monoaromat ic catabolic • operon) the sequence 
of which has been made available by the present inventors 
30 for the first time. This is described in more detail below, 

and in Example 9. Thus an inducible promoter and/or operon 
proteins may be identified by providing a nucleic acid 
molecule having a nucleotide sequence identical to, 
complementary to, or specifically hybridisable with, the 
corresponding part of a known, appropriate, mycolic acid 
bacterial sequence, such as the sequence shown in Fig. 4. 
Preferably parts of the sequence are used as probes, 
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preferably of at least 100 nucleotides {but shorter 
sequences may be employed under high stringency 
conditions) . The use of primers based on the sequence to 
screen and identify target sequences by PGR is also 
envisaged . 

The identified putative inducible promoter can then be 
tested to see if it is operational as described in more 
detail below. Briefly, the putative promoter is provided in 
a vector upstream of a protein coding sequence (e.g. a 
reporter gene) at a position in which it is believed to be 
operatively linked to that coding sequence. A suitable host 
is transformed with the resulting vector. The presence or 
absence of the coding sequence expression product; in the 
presence of the inducing molecule, is determined. For 
putative transport proteins or catabolic enzymes identified 
by homology, function can be confirmed as described below. 

As an alternative, or in addition to, homology 
screening, operon proteins which have catabolic enzymic 
activity can be screened for by their activity. -For 
instance by contacting substrates for the enzymes (the 
analytes) with the host cells , or extracts therefrom, and 
observing for degradation products. 

This approach can be used when the enzyme concerned 
may be successfully expressed in the recombinant host cell-. 
For example, the R. corallina mac operon was isolated by 
screening recombinant E. coli for expression of a catechol 
2, 3-dioxygenase activity induced in R. corallina when grown 
on monoaromatic compounds such as toluene. The substrate of 
the enzyme is catechol, a water soluble 2 hydroxyphenol 
which does not lyse E . coli . 

However, other potential subs trates/analytes e.g. 
toluene are highly toxic to S. coli and may cause its 
membrane to destabilise leading to cell lysis. 
Additionally, gene isolation by function is limited to 
those genes that are expressed in the test bacterium. 
Because of their evolutionary distance from the mycolic 
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acid bacteria, established cloning hosts such as E. coli or 
Gram-positive bacteria such as Bacillus subtilis and 
Staphylococcus aurf^n<=i may not effectively recognise mycolic 
acid bacterial gene regulatory signals and/or may not 
transport or survive in the presence or xenobiotics per se. 
Therefore, isolation by acquisition of novel -phenotype 
cannot easily be accomplished in these hosts. 

In addition, when screening for proteins involved in 
binding or transporting the analyce, or transducing this 
binding event to the inducible promoter (e.g. transcription 
factors) , it may be necessary to use a host in which other 
elements of the entire system (i.e. promoter and/or signal 
gene or catabolic enzymes) are present in order to 
demonstrate activity. 

order to circumvent these problems, in a most 
preferred embodiment of the second aspect, vectors 
comprising the inducible promoter and/or operon proteins 
are identified by means of a functional screen in a second 
host. This can avoid the difficulties described above. 
20 Preferably this second host is a suitable mycolic acid 

bacterium . 

In order that the vectors can be maintained in the 
mycolic acid bacteria, they must encode replicons which can 
function in mycolic acid bacteria. These replicons can be 

2 5 those known in the art (e.g. based on characterised mycolic 

acid bacterial plasmids pSRl (Batt et al . , 1985). 
Alternatively the present inventors have provided a novel 
method of generating supercoiled or circular plasmid DNA 
from mycolic bacteria, and this method forms one part of 
the present invention. The diversity of the mycolic acid 
bacteria means that it is unlikely that a single replicon 
will be sufficient to construct biosensors in all strains 
encountered. Novel replicons which can be used either alone 
or in conjunction (two or more per vector) with other 

3 5 replicons to expand host range therefore provide a useful 

contribution to the art. 

Thus, using the supercoiled/plasmid method of DNA 
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isolation detailed in Example 4,. two previously 
uncharacterised plasmids pRClOO and pRC158 have been 
discovered in soil mycolic acid bacteria Rhodococcnc:: 
corallina and mycolic acid bacterium strain RC158 
respectively . 

Strain RC158 contains a supercoiled plasmid of 
approximately 14.57 kb . The plasmid, designated pRC158, 
contains at least five EcoRI restriction enzyme sites which 
can be used to digest the plasmid into a specific 
restriction pattern of five major restriction fragments of 
^-3, 2.9, 2.2 and 1.6 Kb DNA respectively. An 
approximately 100 kb plasmid, pRClOO, was isolated from R. 
corallina 

Replicons may be identified from novel plasmids by 

15 screening fragments obtained therefrom in disabled vectors 

containing marker proteins (for instance based on pJP7 
described below) to see if they can replicate in mycolic 
acid bacteria. 

Novel plasmids isolated using the method, and novel 

20 replicon elements isolated from them, form a fourth aspect 

-of the present invention. These, and existing replicons, 
may be used to construct cloning vectors v/hich replicate in 
several mycolic acid bacterial strains. Thus it is 
possible to clone, isolate by function and express specific 

25 genes from not only a single "type strain" as is the common 

practice in molecular biology but also in a variety of 
mycolic acid bacteria. 

It is preferable that the transfer of the vectors 
comprising the putative inducible promoters and/or operon 

30 proteins to the second host (preferably mycolic acid 

bacteria) from the first host (preferably an established 
cloning systems such as E. coli ) be achieved using 
bacterial conjugation. Experiments have shown that 

restriction enzyme activity in newly isolated mycolic acid 

35 bacteria effectively limits the efficiency of 

electroporation of incorrectly methylated plasmid DNA to 
very low, or undetectable levels. It is known that most 
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restriction enzymes preferentially act on double stranded 
DNA substrates. It is known that conjugative DNA transfer, 
however, involves a single-stranded DNA intermediate and is 
thus relatively immune to restriction. It is known that 
5 the IncPa conjugative plasmid RP4 can transfer its DNA into 

a wide range of bacteria by conjugation. Accordingly, a 
series of conjugatively mobilizable mycolic acid bacteria 

/ E-i coli shuttle vectors have been constructed by 

incorporation of a 440 bp region of the RP4 plasmid 

10 encoding the origin of transfer (pJP8 figure 1) 

Experiments have shown that RP4 oriT vectors can be 
complemented in trans for tra functions allowing 
conjugative mobilization into a variety of mycolic acid 
bacteria at high efficiency. 

15 The vectors for use in the most preferred embodiment 

of second aspect of the invention (i.e. functional 
screening in a second host) , themselves form a fifth aspect 
of the present invention, such vectors typically 
comprising: 

20 (a) a replicon for mycolic acid bacteria 

(b) a replicon for E. coli 

(c) a conjugative origin of transfer 

(d) a lambda cos site 

An example of such a vector is that termed pJP8 
25 (Figure 5) . This comprises (a) pCY104oriV, (b) pBR322 oriV 

(c) RP4 oriT , and (d)a cos site; however it will be 
apparent to those skilled in the art that any of these 
could be substituted for a sequence having similar 
function, for instance substituting pRClOO or pRC158 
30 minimal replicon sequences for the novel pCY104 replicon. 

In use such vectors will further comprise a fragment 
containing the putative inducible promoter and/or operon 
proteins and optionally a signal protein, such as have been 
described above. 

^5 Thus a gene library can be constructed in a 

mobilizable cosmid shuttle vector such as pJPB . After in 
vitro packaging, cosmids can be recovered by adsorption to 
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E. coli carrying mcrABC mrr hsdSRM recA recO. Given Che 
size of the mycolic acid genome (approximately 4 Mb) a 99% 
confidence gene library requires approximately 2500 
colonies . 

To screen for specific functions (either a complete 
reaction pathway or specific reactions) the packaged 
cosmids may be adsorbed to E. coli mcrABC mrr hsdSRM recA 
recO containing an IncP plasmid such as RK2 . Since the RK2 
plasmid encodes several antibiotic resistance genes, it is 
modified by random mutagenesis to disable antibiotic 
resistance genes which are also used as markers in the 
cosmid vector. From this transformed strain, the 

mobilizable cosmid shuttle vector may be conjugated into a 
wide variety of mycolic acid bacteria for functional 
screening. In any such screen, the choice of mycolic acid 
bacterial strain will be governed by the known catabolic 
functions of the strain. Thus entire pathways may be 
isolated by screening for gain of function. Alternatively, 
if a particular strain is known to require only one or a 
few catabolic activities these may be screened for by 
complementation . 

By incorporation of a signal gene adjacent to the 
cloning site in pJP8 used to construct the gene library, 
transconjugant mycolic acid bacteria can be screened for 
inducible expression of a signal protein such as luciferase 
in the presence of specific molecules. This will rapidly 
isolate environmentally responsive 

promoter/operator/regulator elements . 

Once identified, by any of the methods of the second 
aspect of the invention above, the putative inducible 
promoter and/or operon proteins may be modified by 
subcloning mutagenesis (typically within E. coli ) and 
screened for enhanced function in mycolic acid bacteria. 

The term 'modified' is used to mean a sequence 
obtainable by introducing changes into the full-length or 
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part -length sequence, for example substitutions, 
insertions, and/or deletions. This may be achieved by any 
appropriate technique, including restriction of the 
sequence with an endonuclease followed by the insertion of 
5 a selected base sequence (using linkers if required) and 

ligation. Also possible is PCR-mediated mutagenesis using 
mutant primers . 

It may, for instance, be preferable to add in or 
remove restriction sites in order to facilitate further 
10 cloning. 

Alternatively, it may be parcicularly desirable to 
modify the binding protein/agent m order to modify its 
specificity and/or affinity for analyte . 

Modified sequences according to the present invention 
may have a sequence at least 70% identical to the sequence 
of the full or part-length inducible promoter or operon 
protein as appropriate.. Typically there is 80% or more, 
90% or more 95% or more or 98% or more identity between the 
modified sequence and the authentic sequence. There may be 
up to five, for example up to ten or up to twenty or more 
nucleotide deletions, insertions and/or substitutions made 
to the full-length or part length sequence provided 
functionality is not totally lost. 

Modified promoters and/or operon proteins can be 
25 screened for functionality as described above in relation 

to isolating novel elements. 

Nucleic acid encoding the authentic or modified 
promoter and/or genes encoding the operon proteins (plus 
such modified proteins themselves) identified or obtained 
by the method of the second aspect of the invention form a 
sixth aspect of the invention. 
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Thus one embodiment of the sixth aspect is the R. 
corallina mac locus described in Figures 3 and 4 including 
3 5 the promoter and individual operon proteins encoding 

therein, and modifications thereof. 

The authentic or modified promoter identified or 
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obtained by the method of the second aspect of the 
invention may be used to inducibly express a heterologous 
signal protein in a transformed host; this use forms a 
seventh aspect of the present invention. 
5 In one embodiment of the seventh aspect, there is 

disclosed a method of transforming a hose with a vector 
encoding the inducible promoter as described above, 
operably linked to the signal gene (e.g. encoding 
luciferase) . 

The vector used in the sevench aspect may remain 
discrete in the host. Alternatively it may integrate into 
the genome of the host. 

For a potential host (e.g. Corynebacterium) which does 
not express or generate the other components of the system 
which may be required to give biosensor function (for 
instance the operon proteins such as the transport protein 
to transport analyte into the cell; binding protein to bind 
analyte thereby inducing the promoter activity; cofactors 
required for signal protein activir-.- etc.) these components 
can be added exogenously in order to perform the methods of 
the first aspect, or can be encoded on the vector used to 
introduce the inducible promoter. Indeed, as stated above, 
any transport and binding proteins required for the 
functionality of the inducible promoter will frequently 
25 form part the operon containing the promoter, and may thus 

be identified and isolated alongside it using the methods 
disclosed above. 

Preferably, however, the host (e.g. a mycolic acid 
bacterium, either the same or different to that which 
provided the source of the inducible promoter, but 
preferably the same) will itself naturally express the 
other components of the system required to give biosensor 
function . 

Indeed in this latter case, the signal protein gene 
may be introduced into the host such that it is operably 
linked to an existing inducible promoter. In this 
embodiment of the seventh aspect of the invention the 



20 



30 



35 



15 



or 



10 



15 



25 



30 



identification and or isolation of the promoter 
associated proteins as described above ultimately provides 
the information required to allow targeting of the gene 
into this region. Typically this will be achieved by 
initiating targeted integration using aspects of the 
sequence forming part of the promoter region or operon. 

Direct integration of a signal gene system such as 
lucif erase (e.g. luxAB operon) into an environmentally 
responsive regulon in a mycolic acid containing bacterium 
may be more efficient than approaches based on isolation of 
gene{s) and its/their characterisation followed by 
construction of the biosensor. This integration can be 
achieved by transposition or by illegitimate or legitimate 
recombination between a genetic construct introduced into 
the cell and the target operon or gene cluster located on 
either the chromosome or an episomal element. In 
situations where a gene cluster or operon has been, 
identified as above, by either screening in E. coli or 
direct functional cloning in a mycolic acid bacterium, 
20 site-specific recombination may be used to direct 

integration of the signal gene{s) (such as lucif erase) into 
the regulon. 

Vectors for use in the seventh aspect of the 
invention, form an eighth aspect of the invention. Such 
vectors will typically include: (a) the signal gene, plus 
(b) the inducible promoter, operably linked to the signal 
gene, or a sequence capable of initiating recombination of 
the signal gene such that it becomes operably linked with 
the inducible promoter. Further operon proteins (optionally 
modified) may also be included in the vector. 

Vectors of the eighth aspect of the invention can be 
readily constructed on the basis of the present disclosure, 
for instance based on pJP7 (Figure 6) which is described in 
more detail below. A ninth aspect of the invention is a 
3 5 (biosensor) host transformed with the vectors of the eighth 

aspect . 

In using the transf ormants of the ninth aspect in the 
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methods of the first aspect, the signal (such as bacterial 
luciferase) may be detected extracellular ly using a 
photomultiplier or photodiode or any other photosensitive 
device. This maintains the cell integrity and thus 
resistance to environmental shock. 

Also embraced within the scope of the present 
invention are kits for performing the various aspects of 
the invention. For instance a kit suitable for use in the 
first aspect may comprise a preparation of che 
microorganism, plus further means for carrying out the 
contact or observation steps e.g. buffers, co- factors (e.g. 
luciferin for addition to luciferase) . A kit for performing 
the second aspect may include any of the following: 
selective buffer, a non-ionic detergent, any means for 
15 carrying out the screening process (e.g. primers, probes, 

substrates, for catabolic enzymes, vectors for transfer into 
a second host) . Kits for performing the seventh aspect may 
include vectors for generating biosensors plus other means 
for transforming hosts with them (e.g. buffers etc.). 
20 The invention will now be further described with 

reference to the following non-limiting Figures and 
Examples. Other embodiments falling within the scope of the 
present invention will occur to those skilled in the art in 
the light of these. 

25 



17 



Ficaires 



Figure 1 - shows an agarose gel on which digestions of 
the novel plasmid pRClOO has been run, as described in 
Example 5. 

Figure 2 - shows an agarose gel on which digestions of 
the novel plasmid pRC158 has been run, as described in 
Example 5 , 

Figure 3 - shows a schematic view of the R . corallina 
mac operon obtained by functional screening in E. coli . as 
described ■ in Example 7. The schematic shows location of 
predicted genes Regulator REG, Transport ' TRANS, 
Monooxygenase MONO, Hydroxymuconic semialdehye hydrolase 
HMSH, Alcohol dehydrogenase ADH, and confirmed gene 
Catechol 2, 3 -dioxygenase CDC. Initiator and terminator 
codons are shown as half height and full height lines 
respectively. The location of predicted promoter regions 
and direction are indicated by arrows. The molecular 
weights and coordinates of open reading frames are 
tabulated . 

Figure 4 - shows the complete listing of the R. 
corallina mac operon as described in Example 7. It includes 
a portion of a putative nitropropane promoter (5' of the 
regulator) . 

Figure 5 - shows a schematic diagram of the pJPB 
vector of the present invention, as described in Example 8 , 
Plasmid size is about 8.51 kb . pJP8 is a mycolic acid 
bacterium - E. coli mobilizable cosmid vector. It carries 
pCY104 replion; is Kanamycin resistant 15 fig/ml mycolic 
acid bacteria, 50 /xg/ml E. coli . It also carries lambda cos 
site, RP4 oriT site and a multiple cloning site. 

Figure G - shows a schematic diagram of the pJP7 
vector of the present invention, as described in Example 9. 
Plasmid size is about 10.66 kb . pJP7 is a mobilizable E. 
coli./Rhodococcus/Nocardia suicide/lucif erase integration 
vector encoding luxAB signal genes, sacB gene and 
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thiostreppton resistance in Rhodococcus/Nocardia only up to 
75 fig/ml (typically 1-10 ^g/ml used in selections) . The 
vector is RP4/RK2 mobilizable . By cloning a region of 
homology into the region upstream of the luxAB cassette, 
5 insertion can be targeted . 
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Example 1 - A novel me d ium for oliaotiroTDhic screenincr 

"MMRN" is prepared as a mul t icomponent stock to avoid the 
production of uncharacterised compounds during autoclaving. 
A "basic salts" stock is prepared' containing- 6g/L Na2HPO^ ; 
3g/L KH^PO,; Ig/L NaCl ; 4g/L (N^-l^SQ; adjusted to pH 7 . 4 
and made up to 989 mis with distilled water and autoclaved. 
A "lOOx A salts", solution is prepared consisting of 20g/L 
MgSO,; 2000 mg/L FeSq,7EO; 200 mg/L FeCl ; 200 mg/L MnSO 
4-H 2O is prepared in distilled water and autoclaved. A 
"lOOOx B salts" solution consisting of 500 mg/L ZnSO, .7H.O; 
15 200 mg/L CUCI2.2HQ; 200 mg/L Na^Q.l^OKO^ 100 mg/L 

(NH4J7M06O24 .4H2O is prepared in distilled water and 
autoclaved. To prepare 1 litre of MMRN, sterile solutions 
of 989 mis basic salts, 10 mis 100 x A salts, 1 ml 1000 x 
B salts are combined. For solid' media, agar is added to 
20 1.4% w/v. Carbon-energy sources are supplemented to 80 

final concentrations for soluble molecules, or as vapour 
for insoluble molecules (where their concentration is 
decided by their individual partition coefficients 
generally ranging from 3 to 40 fiM) . Petri plates or liquid 
cultures are incubated at 28°C to 30^C for up to 72 hours 
to accumulate sufficient biomass for genetic and 
biochemical testing . 
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Example 2 — = — Isolation of n ovel strains of mvcolic acid 
containincr bacte ria from environmental samples using an 
oliaotrophic screen and MMRN 



Novel strains are a source of genetic diversity from 
which biosensors specific for particular xenobiotic 
3 5 compounds can be constructed. To isolate mycolic acid 

bacteria, for example Rhodococcus / Nocardia . from an 
environmental matrix such as soil, a rapid isolation 
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technique is required. Isolation of bacteria from soil 
using standard laboratory media containing eutrophic levels 
of carbon preselects for eutrophic bacteria which can grow 
rapidly under these conditions, Oligotrophic bacteria such 
5 as Rhodococcus / Nocardia are rarely successfully isolated 

on such rich media. This can be carried out using MMRN to 
specifically enrich for and subsequently purify strains of 
mycolic acid-containing bacteria which encode catabolic 
pathways whose expression is induced by a given xenobiotic. 

10 This methodology identifies molecules which are noc only 

substrates, but are necessary and sufficient to induce the* 
appropriate catabolic pathway. Soil suspensions from a 
matrix likely to express a desired phenotype (for instance 
a site known or believed to have been- contaminated with a 

15 particular xenobiotic) can be used to inoculate MMRN 

supplemented with an oligotrophic level of a easily 
utilised carbon source (50/zM) . This provides an initial 
oligotrophic screen. Oligotrophic mycolic acid-containing 
bacteria are slow grov/ing and may be expected to have 

20 formed colonies after 72 hours incubation at 28°C on MMRN 

paraffin. The incubation temperature appears to be highly" 
selective of soil Nocardioform bacteria; Petri plates 
incubated at temperatures above 30°C fail to show^ 
detectable colonies. Colonies growing on alkanes can be 

25 initially screened for Nocardioform phenotype, selecting 

for crumbling, crenellated colonies, (possibly mucoid on 
rich media) . Gram- and Ziehl-Neelsen-staining tests 
rapidly identify Gram-positive, mycolic acid-containing 
bacteria (Place a slide carrying a heat fixed film on a 

30 slide carrier over a sink. Flood with carbol fuchsin 

solution (basic fuchsin 5g; phenol, crystalline, 25g; 95% 
or absolute ethanol 50 ml; distilled water 500 ml) and heat 
until steam rises. .Leave for 5 minutes, heating 

occasionally to keep the stain steaming. Wash with 

3 5 distilled water. Flood slide with 20% v/v sulphuric acid; 

wash off with distilled water, and repeat several times 
until the film is a faint pink. Finally wash with water. 
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Treat with 95% v/v ethanol for 2 minutes. Wash with 
distilled water. Counterstain with 0.2% w/v malachite 
green. Wash and blot dry. Acid and alcohol fast organisms 
are red, other organisms are green) 
5 Mycolic acid-containing bacteria may then be screened 

for specific hydrocarbon-inducible catabolic pathways using 
MMRN supplemented with the target xenobiotic pollutant. 
Strains for which the target molecule is growth inducing 
may then be isolated and used to as a source of genetic 
10 regulatory elements for biosensors or as specific 

biocatalytic functions. Using this protocol mycolic acid 
containing bacteria have been and may be rapidly identified 
with novel and useful catabolic properties. This approach 
is also useful for identification and isolacion of mycolic 
acid containing bacteria with biocatalytic properties. 

Example 3 - Method for isolation o f total nKTA. from my r-r^lir^ 
acid bacteria 
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Bacterial strains were inoculated into 10 mis of MMRN 
supplemented with 500mM glucose 2% w/v L-glycine and 
incubated at 28°C for 30 to 40 hours. This medium supports 
relatively rapid growth of mycolic acid bacteria cells. 
The L-glycine present is misincorporated into pept idoglycan 
cell wall substantially weakening its resistance to osmotic 
shock (Katsumata, et al . , 1984). Growth on MMRN appears to 
enhance the uptake of. L-glycine and its apparent 
misincorporation into the cell arabinogalactan . During 
this growth phase, mycolic acid bacteria produce 
30 extensive surfactants which cause the accumulated biomass 

to clump into pellicles and exhibit a strong surface 
tension effect. These pellicles, which are highly 

resistant to lysozyme, may be broken up and the 
concentration of biosurf actants substantially reduced by 
washing the cell pellet in several culture volumes of 10 mM 
Tris pHB.O; 0.1% Tween 80 and finally resuspended in 1ml of 
10 mM Tris HCl pH8 . 0 , containing 10 mg/ml lysozyme. The 
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lysozyme reaction is incubated 60 to 100 minutes at 37^c 
depending on the strain involved. Lysis is achieved by 
addition of 2% final (w/v) sodium dodecyl sulphate at 60°C 
40 minutes. The nucleic acids are selectively purified from 
the cellular debris by sequential phenol, phenol: 
chloroform lisoamyl alcohol (50:48:2 v/v) extractions. 
Nucleic acids are concentrated by ethanol precipitation in 
2 M ammonium acetate. The nucleic acid pellet recovered is 
washed with 70% ethanol and resuspended in 100^1 10 miM 
Tris.HCl pH8.0, ImM EDTA . 2 ^1 of this sample may be 
digested using restriction enzymes. 

Example 4 - Method to isolate suoercoiled/circular' plasmid 
DNA from mvcolic acid bacteria 

50 mis Rhodococcus was cultured to mid- logarithmic 
phase in MMRN supplemented with 2% w/v L-glycine, 2% w/v 
D-glucose. 

The cell pellet was washed in 10 rruM Tris pHB . 0 and 0.1% 
Tween 80. Resuspend cell pellet in 7.6 ml 6.7% sucrose; 50 
mM Tris.HCl; 1 mM EDTA. Add 2 ml 40 mg/ml lysozyme in 10 
miM Tris.HCl 1 mM EDTA. Incubate 37°C 15 minutes. Add 970 
Ml 250 mM EDTA, 50 mM Tris.HCl pH 8.0. Continue incubation 
for a further 105 minutes 37 °C. Lyse cells by addition of 
600 /il 20% SDS 50 mM Tris.HCl, 20 mM EDTA pH 8.0. Incubate 
55 °C 3 0 minutes. Shear lysate by vigorous vortexing 3 0 
seconds. Denature DNA by addition of 560 fil freshly 
prepared 3 M NaOH followed by gently mixing 10 minutes room 
temperature. Neutralise by addition of 1 ml 2 . 0 M Tris .HCl 
pH 7.0 with gentle mixing 10 minutes. Add 2.1 ml 20% SDS 
50 mM Tris.HCl, 1 mM EDTA. Mix gently. Add 4.2 ml ice 
cold 5 M NaCl . Incubate on ice overnight or for several 
hours at least. Clear the cellular debris by 

centrifugation at 48000 g 4°C 90 minutes. The supernatant 
contains the DNA. Decant the supernatant by addition of an 
equal volume of ice cold isopropanol . Incubate -20*'C 30 
minutes. Pellet nucleic acids 4°C, lOOOOg 20 minutes. 
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Example 5 : Novel plasmids and replicons obtained by the 
method of Example 4 

Two multicopy plasmid replicons were isolated using 
5 the method of Example 4; pRC158 from strain RC158 and 

pRClOO from R. corallina . 

Both plasmids have been digested with restriction 
enzymes to produce characteristic restriction patterns 
(Figures 1 and 2) . 

10 Plasmid pRClOO, an approximately lOOkb supercoiled 

circular plasmid present in R. corallina was prepared as 
described in the text. The agarose gel was loaded in lane 
1 with Lambda DNA Hindlll size markers (23,130 bp; 9,416 
bp, 6,557 bp, 4,361 bp, 2,322 bp, 2,027 bp, 564 bp); lanes 

15 2 to 9 inclusive were loaded with pRClOO digested with 

BamHI (5 ' GGATCC3 ' ) , Bell ( 5 ' TGATCA3 ' ) , Bglll ( 5 ' AGATCT3 ' ) , 
EcoRI (5'GAATTC3' ) , Hindlll ( 5 ' AAGCTT3 ' ) , Kpnl 

5 ' (GGTACC3 ' ) , Sad ( 5 ' GAGCTC3 ' ) , Sail { 5 ' GTCGAC3 ' ) 
restriction endonuclease reactions . v/hich were carried out 

20 under standard conditions; lane 10 contains undigested 

(presumable supercoiled) pRClOO DNA; lane 11 pWWllO/4 0 1 2 1 , 
lane 12 pWWllO/4011; lane 13 pWW15/3202; lane 14 pUC18 lane 
15 blank. The DNA fragments have been resolved on a 0.8% 
Agarose Tris-Acetate-EDTA gel. Southern blotting analysis 

25 using Gram-negative mono and polyaromatic catechol 2,3- 

dioxygenases failed to detect significant sequence 
conservation - 

Plasmid- pRC158 is a supercoiled plasmid of 
approximately 14.57 kb . The plasmid was digested with the 

30 EcoRI (5'.GAATTC3 ' ) restriction endonuclease under standard 

conditions. The DNA fragments have been resolved on a 0.8% 
Agarose Tris-Acetate-EDTA gel. This pattern is unique and 
characteristic to pRC158 . The plasmid contains at least 
five EcoRI restriction enzyme sites which can be used to 

35 digest the plasmid into a specific restriction pattern of 

five major restriction fragments of 4.3, 3.3, 2.9, 2.2 and 
1.6 Kb DNA respectively. 
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These plasmids are relatively small, exhibit a high 
plasmid copy number and are easily isolated from 
Rhodococcus / Nocardia . Therefore, they possess several 
characteristics which are suitable for the construction of 
5 Rhodococcus / Nocardia cloning vectors. 

The DNA sequence of the minimal replicon regions of 
these plasmids may be determined by screening fragments 
obtained therefrom in disabled vectors concaining marker 
proteins (for instance based on pJ?7 described below) to 
10 see if they can replicate in mycolic acid bacceria. 

Example 6: Hybridisation screening for novel promoters 
and/or operon proteins 

15 The test sample (host cells) are contacted with a 

nucleic acid molecule probe (preferably around 100 
nucleotides or more) based on Figure 4 under suitable 
hybridisation conditions, and any test DNA which hybridises 
thereto is identified. Such screening is initially carried 

20 out under low- stringency conditions, which comprise a 

temperature of about 37°C or less, a formamid4 
concentration of less than about 50%, and a moderate to low 
salt (e.g. Standard Saline Citrate ('SSC') = 0.15 M sodium 
chloride; 0.15 M sodium citrate; pH 7) concentration. 

25 Alternatively, a temperature of about 50°C or less and a 

high salt (e.g. ^SSPE'= 0.180 mM sodium chloride; 9 mM 
disodium hydrogen phosphate; 9 mM sodium dihydrogen 
phosphate; 1 mM sodium EDTA; pH 7.4). Preferably the 
screening is carried out at about 37°C, a form amide 

30 concentration of about 20%, and a salt concentration of 

about 5 X SSC, or a temperature of about 50°C and a salt 
concentration of about 2 X SSPE. These conditions will 
allow the identification of sequences which have a 
'substantial degree of similarity with the probe sequence, 

3 5 without requiring the perfect homology for the 

identification of a stable hybrid. The phrase 'substantial 
similarity refers to sequences which share at least 50% 
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overall sequence identity. Preferably, hybridisation 
conditions will be selected which allow the identification 
of sequences having at least 70% sequence identity with the 
probe, while discriminating against sequences which have a 
lower level of sequence identity with respect to the probe. 
After low stringency hybridisation has been used to 
identify several clones having a substantial degree of 
similarity with the probe sequence, this subset of clones 
is then subjected to high stringency hybridisation, so as 
to identify those clones having a particularly, high level 
of homology with respect to the probe sequences. High 
stringency conditions comprise a temperature of about 42 °C 
or less, a foorm amide concentration of less than about 20%, 
and a low salt (SSC) concentration. Alternatively they may 
15 comprise a temperature of about 65°C or less, and a low 

salt (SSPE) concentration. Preferred conditions for such 
screening comprise a temperature of about 42 °C, a form 
amide concentration of about 20%, and a salt concentration 
of about 2 X SSC, or a temperature of about- 65°C, and a 
20 salt concentration of about 0.2 SSPE. 

Example 7 ~ Clonin g aromatic degradative operon from 

Rhodococcus co rallina by functional screening in E coli 

'^o demonstrate the potential mycolic acid bacteria 
^^•9' Rhodococcus / Nocardia) have as biosensors and 
biocatalysts as well as to validate the novel genetic tools 
and approach to cloning of the present invention, a gene 
cluster or operon associated with aromatic degradation was 
cloned and isolated from Rhodococcus corallina . This gene 
cluster / operon appears to be a broad substrate range 
monoaromatic degradative pathway and has been designated 
monoaromatic catabolic ( mac) gene cluster or operon. R. 
corallina was isolated from pristine soil in Canada and is 
3 5 an acknowledged Rhodococcus type strain. This strain 

encodes a broad range of catabolic activities which include 
toluene, benzoate, phenol, cumine, cyamine . Genetic 
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induction of the toluene degradative pathway in R . 
coral lina occurs when toluene is supplied as vapour. This 
is a level of less than 200 ppm in water. Therefore, the 
sensitivity inherent in the biology of Rhodococcus is very 
close to those levels expected for biosensors in industrial 
use. Similar experiments using a naphthalene utilising 
Rhodococcus which is also supplied as a vapour 

Biochemical assays of ri.ng cleavage dioxygenase 
activities in crude enzyme extracts of R. corallina cells 
grown on MMRN supplemented with different growth- inducing 
xenobiotics indicated that the .rr.olecular specificity of 
ring cleavage dioxygenase induction is good. Toluene 
• induced the meta pathway (although some ort ho activity v/as 
observed) whereas benzoate and phenol exclusively induces 
the ortho pathway. Xylene, which is very closely related 
to toluene does not act as a growth inducing substrate. 
The closely related compounds toluene and benzoate but not 
xylene induce different ring-cleavage enzymes despite their 
relatively similar molecular shape. This behaviour and 
absence of induction with xylene suggests that the receptor 
for these or metabolites derived from these molecules is 
sensitive to minor electrostatic changes in their ligand. 
This strongly asserts that genetically constructed 
biosensors derived from these receptor molecules and their 
regulated promoter (s) will exhibit a level of specificity 
which exceeds that currently available as field test 
systems . 

Since a clear catechol 2 , 3 -dioxygenase activity was 
induced by toluene, but not by benzoate (indicating that 
the meta pathway in this strain is specifically induced by. 
toluene), the catechol 2 , 3 -dioxygenase activity can be used 
as a marker for gene(s), gene cluster(s) or operon(s) 
involved in its degradation. 

The R. corallina catechol 2 , 3 -dioxygenase structural 
gene was isolated by functional screening of a partial 
Sau3A restriction enzyme digest -generated gene library in 
CQli hsdRMmcrAB for using the commercially available 



cosmid cloning vector pWElS (Wahl et al . , 1987). 

Because only a single enzyme activity, has been used as 
a functional marker rather than complete acquisition of a 
phenotype and given the diversity of Rhodococcus / Nocardia 
5 metabolism and the genetic incompatibility between mycolic 

acid bacteria and E. coli it is possible that numerous 
catechol dioxygenases may exist but only some will be 
expressed successfully in E. coli . To facilitate 

expression of cloned DNA irrespective of the presence of an 

10 indigenous promoter element, a phage T7 promoter is located 

adjacent to the pWElS unique BamHI restriction site into 
which the rhodococcal DNA was inserted. Phage T7 RNA 
polymerase (a single polypeptide) is supplied in trans from 
pGPl-2Sm. As a functional screen for 2 , 3 -dioxygenase 

15 activity, catechol was sprayed onto nutrient agar plates 

supplemented with 15 fig/ml kanamycin, 50 fJig/ml 
streptomycin, 0.1 mM isopropyl thiogalactoside (IPTG) 
incubated at 3 0°C to accumulate biomass . The expression of 
T7 polymerase is repressed by temperature sensitive phage 

20 lambda repressor which is itself expressed from an IPTG 

induced lacUV5 promoter. Thus incubation at 42 °C leads to 
induction of T7 polymerase expression and so transcription 
of the pWE15 insert region from the T7 promoter (i.e. one 
direction of transcript alone) . 

25 Using the pGPl-2Sm T7 expression system, two colonies 

were isolated which encoded the characteristics catechol 
2 , 3 -dioxygenase activity from R. corallina . From 
approximately 3 00 0 colonies of individual primary clones of 
R. corallina gene library in an E. coli hsdRMmcrAB strain, 

3 0 two colonies were observed to produce a deep yellow colour 

indicative of catechol . 2 , 3 -dioxygenase activity 
( 2 -hydroxymuconic semialdehyde) when exogenous catechol was 
supplied in phosphate buffer (O.IM pH7.4). These clones 
were designated clone #1 and clone #2. Restriction enzyme 

3 5 mapping of both clone #1 and clone S2 DNA showed that both 

encode overlapping regions of DNA but were otherwise 
nonsibling clones; this is compatible with a primary 



28 



screening of a cosmid library. 

Southern blot analysis of R. corallina total cellular 
and plasmid DNA confirmed that the isolated catechol 
2 , 3 -dioxygenase locus in clones #1 and #2 are contiguous 
5 with an approximately 3 5 kb region R. corallina . genomic 

DNA. The common region to both clones is comprised of 
seven major EcoRI restriction fragments (8.3, 7.2, 5.2, 
4.9, 4.3, 2.4, 2.3 Kb respectively 34.6 kb in total). To 
confirm the continuity and source of the clone #1 and 

10 clonetf2 inserts, an aliquot of clone #'2 DNA, which 

contained a slightly longer R. corallina DNA insert chan 
clone #1, was used as a source of DNA co synchesise a 
radioactive probe to identify homologous DNA restriction 
fragments present in an EcoRl restriction digest of total 

15 cellular R. corallina DNA as well as other bacterial DNA 

samples. An randomly picked pWElS clone which did not 
express catechol 2 , 3 -dioxygenase was chosen as one control 
(cosmid clone "clone # 4") and E. coli genomic DNA were 
selected as control DNAs . At the level of accuracy of the 

20 gel, the coincidence of the catechol 2 , 3 -dioxygenase clones 

#1 and clone #2 DNA inserts relative to the genomic RJ;' 
corallina EcoRI and Smal restriction maps indicated that no 
gross deletions or rearrangements had occurred during the 
cloning. Significantly, there was no evidence for a 

25 supercoiled plasmid location for the catechol 

2 , 3 -dioxygenase gene indicating that . the locus is 
chromosomally encoded (although pRClOO has been isolated 

f^om R^ corallina (see Figure 1) this strain does not 

encode large linear plasmids) . To investigate the 

30 potential for gene homologs to be identified a Rhodococcus 

strain RC161 v/hich was isolated from North East England and 
so is distinct from R. corallina (v/hich also degrades 
toluene via meta cleavage but was isolated form soil in 
Canada) was included in the Southern Blot. There were 

35 three RC161 EcoRI restriction fragments which exhibited 

significant DNA sequence conservation with R. corallina 
sequences in clone #2. The nature of these sequences is 
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under investigation. 

Colony hybridisation to the R. corallin;. gene library 
secondary screen using the 2.4 Kb EcoRI restriction 
fragment of clone #2 as a source of radioactive probe 
identified four cosmid clones, pWE15#C, pWE15#D, pWE15#B 
and pWEl5#G encoding overlapping regions of the 
corallina chromosome. Thus a region of the R.corall in^ 
genome with a contiguous length of approximately 70 kb has 
been cloned and isolated. These cosmids will provide a 
source of R. corallina DNA for future experiments. 

The 3 5 Kb region encoded by clones Hi and #2 was 
mapped using four six base recognition restriction enzymes. 
An analysis of the map does not indicate inverted "DNA map 
elements which could be consistent with a transposable 
15 element. This does not, however, preclude this possibility 

existing . 

The sequence of the operon is described in Example 9 
below. 
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A key aspect of this invention is the ability to 
genetically manipulate a variety of strains or species of 
mycolic acid-containing bacteria such as Rhodococcu.c, / 
Nocardia in a simple, effective way so as to clone and 
isolate gene(s), gene cluster{s) or operon(s) with 
applications as biosensors or biocatalysis . 

The closely related mycolic acid-containing bacterium 
Corynebacterium glutamirum may be used as a host to express 
Rhodococcuf^ / Nocardia genetic material. C. alutamir^nm 
shares a common cell wall type and probably similar genetic 
regulation to Rhodococcus / Nocardia but since it has been 
used extensively for the industrial production of amino 
acids and nucleotides it has lost or may never had encoded 
significant xenobiotic catabolic activity. it therefore 
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represents a good "naive" host to express Rhodococcus / 
Nocardia genes . 

Restriction enzyme activity in natural isolates of 
Rhodococcus / Nocardia effectively limits the efficiency of 
5 electroporation to very low, or undetectable levels. Most 

restriction enzymes recognise double stranded DNA 
exclusively. Because single-scranded DNA is a necessary 
product of a replication fork, ncrmal' restriction enzyme 
activity in bacterial cells is lir-ired co couble stranded 
10 DNA substrates. Conjugative DNA transfer in Gram-negative, 

and most probably between Gram- pos i c i ve bacteria as well, 
involves a single-stranded DNA intermediate. Conjugative 
DNA transfer should thus, generally, be relatively im.mune 
to restriction. 

15 The pJP8 plasmid may thus be used to introduce the 

library in the first host into a suitable mycolic acid 
bacterium such as corynebacterium or any mycolic acid 
bacterium which does not encode the desired phenotype . 

The pJP8 plasmid is shovm in Figure 5. The shuttle 

20 vector carries a approximately 400 bp region of the Inc? 

RK2 conjugative plasmid which encodes the origin of 
transfer. This may be complemented in trans by IncP tra 
functions maintained on a suitable compatible recombinant 
plasmid, or as an integrated construct in the host 

25 chromosome or by RK2 itself (modified to disrupt its 

kanamycin resistance gene - a marker used for pJP8) . 
Conjugation involves ''effective contact" between the donor 
and recipient cells, which in this case are E coli encoding 
complementing tra functions and bearing the mobilizable 

30 cosmid vector and a suitable mycolic acid bacterium 

respectively. Effective contact is . the formation of a 
cytoplasmic bridge between the two cells through which 
conjugative DNA transfer occurs. Thus donor and recipient 
cells are grown to mid to late logarithmic phase of growth 

35 in Lauria Bertini broth and MT^RN supplemented- with suitable 

carbon source at 37°C and 36 C respectively. Donor and 
recipient cells are washed in prewarmed media and mixed on 
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a solid support matrix such as Lauria Bertini Agar plate 
and incubated at 37^C for up to 16 hours. The mating 
mixture is scraped from the plate and resuspended in 30°C 
Lauria Bertini broth, from which serial dilutions are 
prepared and plated on MMRN agar supplemented with drugs to 
counter select against the donor and recipient and select 
for the transconjugant mycolic acid bacterium. Commonly, 
naladixic acid selects against the donor and kanamycin 
resistance selects against the recipient. Thus, on a 
plates supplemented with both only the transconjugant may 
grow. The plates are incubated at 30^C for 40 hours. 

Example 9 - DNA sequence of the proximal region of R. 
corallina mac locus 



The DNA sequence of approximately 7 Kb of R. corallina 
chromosomal DNA surrounding the mac catechol 
2 , 3 -dioxygenase has been determined using automated dye 
terminator sequencing reactions. A schematic of the 
current state of the data is presented in Figure 3 which 
shows at least seven genes which have been identified by 
protein sequence conservation with known protein motif data 
(nitropropane dioxygenase, a putative regulatory protein 
orfR, monoaromatic monooxygenase , hydroxymuconic 

2 5 semialdehyde hydrolase, catechol 2 , 3 -dioxygenase , alcohol 

dehydrogenase) . 

The sequence of this region in shown in Figure 4 . 
The predicated gene organisation of the mac associated 
region is indicative of the presence of possibly two 
30' different catabolic gene clusters or operons; one involving 

the nitropropane dioxygenase the other the mac gene cluster 
or operon. Such a genetic organisation suggests that a. set 
of divergent promoter elements are located between the 
predicted regulatory gene orfR and the mac monooxygenase 
35 structural gene. Similarly, another promoter could map 

immediately upstream of the divergent open reading frame 
which has conservation to nitropropane dioxygenase. 
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Exam ple 10 - use of the promoter obtained in Example 9 

The R. corallina genes identified by sequence 
conservation or by function are listed in Figure 3 . These 
5 are potentially useful as catalytic functions, in various 

chemical transformations. The regulatory protein associated 
with the putative mac operon (possibly encoded by orfR) is 
involved in the control of transcriptional initiation at 
its target promoter. This regulatory protein encodes the 

10 specificity of the operon and as such is likely to be 

central to the biosensor function. Subcloning of the 
regulatory protein and its target promoter could permit 
novel biosensor activities to be introduced into other 
Rhodococcus / Nocardia strains. In addition, if this 

15 regulatory protein is subjected to mutagenesis, mutants 

with altered function could be identified (using a 
luciferase promoter probe driven by the regulated 
promoter) . The regulatory protein has a specific 

capability to bind its ligand from the environment. It is 

20 therefore potentially useful as a protein adsorbent for 

specific molecules. This could have application in 
analytical chemistry sample preparation. 

An analysis of the 5'. region of the predicted genes 
and the catechol 2 , 3 -dioxygenase reading frame has allowed 

25 us to predict the sequence involved in translat ional 

initiation. These "ribosome binding sites" can be used as 
sequence guides or templates for the creation of synthetic 
oligonucleotides encoding functional Rhodococcus / Nocardia 
translational initiation sites. Mutagenesis of this region 

30 can identify potentially up and down regulating base 

sequences changes . 

The mac promoter region which concrols expression of 
the cloned operon lies between two putative genes (orfR 
regulatory gene and orfT transport gene) . In addition to 

35 forming the basis of a biosensor, the promoter and its 

cognate regulatory system also could be used as an 
inducible expression system for Rhodococcus / Nocardia and 
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other mycolic acid-containing bacteria. The sequence of 
this region encodes the binding sites and regulatory 
elements or operators involved in control of the mac and 
possibly other closely linked genes or operons . This 
region constitutes the first defined sequence for a 
Rhodococcus / Nocardia promoter region. It can be used as 
a probe to identify similar sequences within other mycolic 
acid containing bacteria such as Rhodococcus / Nocardia . 
This promoter sequence could be used as a region of 
homology to drive targeted recombination / insertion of 
signal gene(s) such as Vibrio lucif erase. 

A vector such as pJP7 (Figure 6) may be used as 

follows : 

The vector is a ^suicide vector' which can be used to drive 
expression of bacterial luciferase genes in R. corallina . 
A portion of the mac promoter region (Figure 4) is ligated 
into the unique pJP7 }CbaI restriction site downstream of an 
— coli trp^A transcriptional terminator. The sac B gene 
allows counter selection for the integrated plasmid thus 
20 selecting for a second cross-over within the plasmid 

sequences to produce a gene replacement of the wild type 
gene with an interrupted gene including luciferase . An 
aspect to this technique is the ability to introduce DNA 
constructs into the target cell in a hyperrecombinogenic , 
non-replicating form. Conjugatively mobilised plasmids may 
represent just such a form in that they may be 
single- stranded form. Thus the conjugatively mobilised 
plasmid pjpv which cannot replicate in mycolic acid 
bacteria could be used directly to integrate DNA constructs 
into a wide range of mycolic acid bacterial strains. 

Example 11 - Biosensor 
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The biosensor of the present invention is typically a 
recombinant mycolic acid containing bacteria which may be 
Rhodococcus / Nocardia cell. The natural gene-regulatory 
system which activates expression of catabolic gene(s) , 
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gene cluster (s) or operon(s) in response to the presence of 
specific class or type of inducing naturally-occurring or 
xenobiotic carbon substrate (s) has been genetically 
manipulated to induce the expression of some signal 
gene{s), such as but not limited to the Vibrio or 
Photobacterium bacterial luciferase in the presence of the 
inducer. This manipulation may have involved either 
incorporation of the signal gene(s) into a chromosomally- 
or episomally-encoded regulon under the control of a 
suitable environmentally- regulated promoter, or by direct 
sub-cloning of the regulated promoter to a rhodococcal / 
nocardial plasmid or other replicon or episomal element: 
encoding a promoter- less signal cene(s). The " generic 
manipulation effecting the substitution or supplemenca t ion 
of the natural genes with the signal gene (s) may involve 
integration of the signal gene{s) gene cluster (s) or operon 
into the host chromosome, plasmid or. other episomal element 
so as to place it under inducible regulatory control or 
subcioning of the analyte (particularly 

hydrocarbon) - responsive promoter to a multicopy plasmid. 
The integration may involve site-specific recombination, 
transposition or illegitimate or homology-dr iven DNA 
recombination which is another aspect of this invention; 
however other methods of DNA integration such as the use of 
polymerase chain reaction (PGR) are not ruled out. 

Signal to noise ratio can be readily improved in the 
recombinant system by enhancing or optimising expression or 
function of the signal gene, which may be luciferase, by 
means of improved gene translat ional signals and/or 
increasing levels of transcription by either raising 
transcriptional rates, mRNA stability or gene dosage of the 
construct (by subcioning to a plasmid or iterative gene 
integrations into a chromosome, plasmid or other episomal 
element). Thus, for instance, transcriptional efficiency of 
the luciferase genes lux AB can be increased by substitution 
of the Vibrio translat ional initiation signals with those 
from the mac operon. 
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Pigure : Plasmid DJl3C^solatioii from 5hQdQCQ ggnja%;q yalllna 
Pla6mi4DRC100, an apprba^ijnacely lOOkb supercoiled ^ixcular plasmid px-eaent in r. 
fiflSalliak^s^s prepared as e^acribed in tiie text. The^^arose gel was loaded in 
lone 1 withNlimnbda DNA Hindlliv^ire markers {23,130 bp; >Vs416 bp, 6,557 bp, 4,361 
bp, 2,322 bp, 2vP27 bp, 564 bplOianeo 3 to 9 inclusinve loaded witb pRClOO 

digested witli Banfex^ ( 5 'GOATCCS ' ) . Bbll ( 5 ' TGATCA3 ' ) , HglXI ( 5N4aATCT3 ' ) , EcoRX 
(5'GAATTC3') * EindIII\^5 'AAGCTT3 ' ) , KphJ 5 ' (GGTACC3 ' ) , Sad (5' 



:tc3 ' ) , Sail 

(5'GTCGAC3') reatrictioh-v^andonuclease rectc^ions which were carrie^\out under 
standfljrd conditions; lane iO contains undigested (presumable superceded J pRClOO 
DMA; lane 11 pWWllO/40121, Ian© 12 pWW110/40llKlane 13 pWW15/3202; I^m 14 pCClQ 
lane 15 blank. The DNA fragmenbs have been resolVed on a 0.3% Agajroee Trls- 
Acetate-EDTA gel. South€!xn blott^rixg anaJ-yais us ing^^tjuTM- negative mono an^v 
polyaromatic catechol 2 , 3-di oxygenases failed to detecc\signif leant sequence 
conservation. ^ 
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Figure : ^Plaaciid DNA Isolation from Rhodococcus scrainvHClSQ digested witli 
EcoRI 

Plasmid pRClS8^\an approximately ISkb sui>^coiled circula-r pt^smid present In 
RC158 was prepared, as described in the texfcs^and digested with cfce EcoRI 
(5*GAATTC3') restrxb^ion endonuclease under 3<;aiidard conditions ?V The DNA 
rragments have bean rksolved on a 0.8% AgaroseXris-Acetate-EDTA ^1. This 
pattern is unique and c^racteristic to pRC158 . 
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Figure: Schematic of R . cor;^ 1 1 ina mac operon showing Location of 
predicted genes Regulator REG, Transport TRANS, Monooxygenase MONO, 
Hydroxymuconic semialdehye hydrolase HMSH, Alcohol dehydrogenase ADH, and 
confirmed gene Catechol 2 , 3-dioxygenase CDO. Initiator and terminator 
codons are shown as half height and full height lines respectively. The 
location of predicted promoter . regions and direction are indicated by 
arrows. The molecular weights and coordinates of open reading frames are 
tabulated. 
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Figure: Complete lifting of R. corallina mac operon encoding predicted 
regulatory promoter region, regulatory protein, operon promoter region, 
transport protein, monooxygenase protein, HMSH protein. Catechol 2,3- 
dioxygenase protein. Alcohol dehydrogenase protein sequences. 

1/1 31/11 

GAA TTC CAT GTT CTT CTC CTT GCA TGT GGC CCG CGT ' TGC CGA GGG CAC TGC TCG GCC TGT 
61/21 * 91/31 

CGC CCG CAG AGG GCG CAT GTC CGG GTG CCT GGA TAT GGC GCG TAC GGC GTG CCC TCC GGC 
121/41 promoter region 151/51 

GTT AAC CCC GAG GTT GGC CAC GAT GCC CCG GCC ATC AGC TCT GGA ATG CTA GCG TTC CAG 
181/61 promoter region 211/71 

ACG AAG GTA ACC CAC AGT GAC TCA CAC CAC AAG TAC TAG A-^T GCA AGC TGT TGC GGT GAG 
241/81 promoter region 271/91 Regulator 
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GCA 


CAG 


AGC 


GCC GTG GTG 


CGA 


ATG 


CCG 


ATC 


GCC 


GAA 


GCG 


? O ^ 


A 




V 


K 


A 


0 


S 


A V V 


R 


M 


■D 


T 


A 




A 


2101/701 
















2131/711 
















.TTC CGC AAG 


CAC. TGG 


AAG 


GAA 


ATT 


CTC 


CTC 


ATC GCG GGC 


ACC 


TAC 


CTG 


TCC 


CAA GGA. 


cm. 


F R t; ' 


H 


W 


K 




I 




L 


I A G 


T 


V 


L 


s 


0 


G 


Y 


2161/721 
















2191/731 
















TTC GCC TAT 


ATC 


TGC 


ATG 


GCC 


TAC 


CTC 


GTC 


TCC TAC GGC 


ACC 


ACC 


GTC 


GCG 


GGG 


ATC 


AGC 


F A V 


I 


C 


M 


A 


V 


h 


V 


S Y G 


T 


T 


V 


A 


G 


T 




2221/741 
















2251/751 
















CGC ACC TTC 


GCC 


CTG 


bcc 


GGA 


GTA 


TTC 


GTC 


GCC GGC ATC 


GTC 


GCC 


GTC 


CTC 


CTC 


TAC 


CTC 


R T 


A 


L 


A 


G 


V 




V 


AGT 


V 


A 


V 


L 


L 


Y 


L 



2281/761 2311/771 



THIS PAGE BUNK (us™, 




GTG TTC GGC 


GCT 


CTG 


TCC 


GAC 


ACT 


TTC 


GGC 


CGC AAG ACC 


ATG 


TAC 


CTG 


CTC 


GGC 


GCC 


GCC 


V F G 


A 


L 


s 


D 


T 


F 


G 


R K T 


M 


Y 


L 


L. 


G 


A 


A 


2341/781 
















2371/791 
















GCG ATG GGT 


GTG 


GTG 


ATC 


GCC 


CCC 


GCC 


TTC 


GCA CTG ATC 


AAC 


ACC 


GGC 


AAC 


CCG 


TGG 


CTG 


A M G 


V 


V 


I 


A 


P 


A 


F 


A L I 


N 


T 


G 


N 


P 


W 




2401/801 
















2431/811 
















TTC ATG GCC 


GCG 


CAG 


GTG 


CTG 


GTC 


TTC 


GGA 


ATT* GCA ATG 


GCC 


CCC 


GCC 


GCC 


GGC 


GTG 


ACA 


F M A 


A 


0 


V 


L 


V 


F 


G 


T A M 


A 


t> 


A 


A 


G 


Y 


T 


2461/821 
















2491/331 
















GGC-TCC CTG 


TTC 


ACG 


ATG 


GTC 


TTC 


GAC 


GCG 


GAC GTG CGC 


TAG 


AGC 


GGT 


GTC 


TCT 


ATC 


GGC 


G S L 


F 


T 


M 


V 




D 


A 


D V R 


V 


S 


G 


V 


S 


T 


G 


2S21/-841- 
















2551/351 














_i 


TAC ACC ATC 


TCC 


CAG 


GTC 


GCC 


GGC 


TCC 


GCG 


TTC GCC CCG 


ACG 


ATC 


GCG 


ACC 


GCC 


TTG 


TAC 


Y T I 


S 


0 


V 


A 


G 


S 


A 


F A P 


T 


r 


A 


T 


A 


L 


V 


2581/861 
















2611/871 
















GGC TCC ACC 


AAC 


ACC 


AGC 


AAC 


TCG 


ATC 


GTG 


ACC TAG CTG 


CTG 


ATC 


GTC 


TCG 


GCC 


ATC 


TCG 


A ?; T 


N 


T,.. ,, 


S 


N 


S 


T 


V 


T V L 




T 


V 


S 


A 


I • 


S 


2641/881 
















2671/391 
















ATC GTC TCG 


GTG 


ATC 


CTG 


CTG 


CCC 


GGC 


GGC 


TGG GGG CGC 


AAG 


GGC 


GCT 


GCG 


AGC 


CAG 


CTC 


TVS 


V 


I 


L 


L 


P 


G 


G 


W G R 


K 


G 


A 


A 


S 


0 


L 


2701/901 
















2731/911 
















ACT CGC GAC 


CAG 


GCC 


ACC 


TCC 


ACA 


CCG 


AAA 


ATG CCT GAC 


ACC 


GAA 


ACA 


TTT 


TCG 


ACT 


CGG 


T R D 


0 


A 


T 


S 


T *' 


P ■ 


K 


M P D 


T 


E 


T 


F 


s 


T 


R 


2761/921" 
















2791/931 






Monooxygenase 




AC A GTT CCG 


GAC 


ACC 


GCA 


GCA 


TCC 


CTG 


CGC 


GTC CTC GAC 


AAG 


TGA 


AGT 


GAT 


GAC 


AGA 


CAT 


T V P 


D 


T 


A 


A 


S 


L 


R 


V L D 


K 


* 


S 


D 


D 


R 


H 


Q ? R 


T 


P 


Q 


H 


P 


C 


A 


S S T 


S 


E 


Y 


_.M .. 


.T , 


D 




2321/941 














2851/951 
















GAG TGA CCA 


CGA 


CCG 


CAC 


CTC 


CTA 


CGA 


CAC 


CGA CGT CGT 


GAT 


CGT 


CGG 


CCT 


CGG 


CCC 


CGC 


SDH 


D 


R 


T 


S 


V 


D 


T 


D V V 


I 


V 


G 


L 


G 


P 


A 


2881/961 
















2911/971 
















CGG TGG CAC 


AGC 


GGC 


GCT 


TGC 


CCT 


GGC 


CAG 


CTA CGG CAT 


CCG 


CGT 


TCA 


CGC 


CGT 


CTC 


GAT 


G .. G T 


A 


A 


L 


A 




A 


S 


Y G T 


R 


Y 


H 


A 


V 


S 


M 


2941/981 
















2971/991 
















.GTT CCC CTG 


GGT 


GGC 


GAA 


CTC 


GCC 


GCG 


CGC 


GCA CAT CAC 


CAA 


CCA 


GCG 


CGC 


CGT 


.CGA 


.AG2L 


F P W 


V 


.A_. 


N 


S 


, ,P_ 


R 


A 


H T T 


N 


Q 


R 


A 


Y 


E 


V 


3001/1001 
















3031/1011 
















GCT GCG TGA 


CCT 


GGG 


CGT 


CGA 


AGA 


CGA 


GGC 


GCG CAA CTA 


CGC 


CAC 


CCC 


GTG 


GGA 


CCA 


GAT 


T. R n 


r. 


Q 


Y 


E 


D 


E 


A 


R N Y 


A 


T 


P 


W 


D 


0 . 


M 


3061/1021 






^AC 










3091/1031 
















GGG CGA CAC 


GCT 


GTT 


CAC 


GAG 


CCT 


GGC 


CGG CGA GGA 


GAT 


CGT 


CCG 


GAT 


GCA 


GAC 


CTG 


G D T 


1, 


T? 


T 


T 


S 


L 


A 


GEE 


T 


V 


R 


M 


0 


T 


W 


3121/1041 
















3151/1051 
















GGG TAG GGG 


CGA 


TAT 


CCG 


CTA 


CGG 


GGA 


CTA 


CCT GTC CGG 


AAG 


CCC 


CTG 


CAC 


GAT 


GCT 


CGA 


GTG 


D 


I 


R 


Y _ 


G 


D 


Y 


L S G 


S 


o 


C 


T 


M 




D 


3181/1061 
















3211/1071 
















CAT TCC GCA 


GCC 


CCT 


GAT 


GGA 


GCC 


GGT 


GCT 


GAT CAA GAA 


CGC 


CGC 


CGA 


ACG 


TGG 


TGC 


GGT 


T P O 




L 


M 


E 


, P. .. 


V 


T. 


T K ^ 


A 


A 


E 


R 


G 


A 


V 


3241/1081 
















3271/1091 
















CAT CAG CTT 


CAA 


CAC 


CGA 


ATA 


CCT 


CGA 


CCA 


CGC CCA GGA 


CGA 


GGA 


CGG 


GGT 


GAC 


CGT 


CCG 


T S F 


N 


T 


V 


Y 


T, 


D 


H 


A O D 




D 


G 


V 




V 


R 


3301/1101 
















3331/1111 
















GTT CCG CGA 


CGT 


CCG 


CTC 


GGG 


CAC 


CGT 


GTT 


CAC CCA GCG 


AGC 


CCG 


CTT 


CCT 


GCT 


CGG 


TTT 


F R D 


V 


R 


S 


G 


T 


V 




X Q R 


A 


R 


F 


h 


L 


G 




3361/1121 
















3391/1131 
















CGA CGG CGC 


ACG 


ATC 


GAA 


GAT 


CGC 


CGA 


ACA 


GAT CGG GCT 


TCC 


GTT 


CGA 


AGG 


TGA 


ACT 


CGC 


D G A 


R 


S 


K 


I 


A 




0 


T G r. 


o 






G 


F 




A 



3421/1141 3451/1151 



THIS PAGE BUNK (umo) 




CCG CGC CGG 


TAC 


CGC 


GTA 


CAT 


CCT 


GTT 


CAA 


CGC GGA CCT 


GAG 


CAA 


ATA 


TGT 


CGC 


TCA 


TCG 


RAG 


T 


A 


Y 


T 


L 


F 


N 


A D L 


S 


K 


V 


V 


A . 


H 


R 


3481/1161 
















3511/1171 
















GCC GAG CAT 


CTT 


GCA 


CTG 


GAT 


CGT 


CAA 


CTC 


GAA GGC CGG 


TTT 


CGG 


TGA 


GAT 


CGG 


CAT 


GGG 


P S T 




H 


W 


I 


Y 


N 


S 


K A G 


F 


G 


E 


I 


G 


M 


G 


3541/1181 
















3571/1191 
















TCT GCT GCG' CGC 


GAT 


CCG 


ACC 


GTG 


GGA 


CCA 


GTG GAT CGC 


CGG 


CTG 


GGG 


CTT 


CGA 


CAT 


GGC 


T. L R 


A 


T 


R 


P 


W 


D 


Q 


W I A 


G 


W 


G 


F 


D 


M 


A 


3601/1201 
















3631/1211 
















GAA CGG CGA 


GCC 


GGA 


TGT 


CTC 


CGA 


CGA 


CGT 


TGT CCT CGA 


ACA 


GAT 


CCG 


GAC 


CCT 


CGT 


CGG 


N G K 


P 


D 


Y 


S 


D 


D 


V 


V L ■ E 


0 


T 


R 


T 




Y 


G 


3661/1221 
















3691/1231 
















CGA CCC GCA 


CCT 


GGA 


CGT 


CGA 


GAT 


CGT 


GTC 


GAG GTC CTT 


CTG 


GTA 


CGT 


CAA 


CCG 


GCA 


GTG 


n ? H 


L... 


D 


V 


E 


T 


V 


S 


R S F 


W 


V 


' V 


N 


R 


0 


H 


3721/1241 
















3751/1251 
















GGC TGA GCA 


CTA 


CCA 


GTC 


CGG 


TCG 


AGT 


GTT 


CTG CGG CGG 


CGA 


CGC 


GGT 


GCA 


CCG 


GCA 


TCC 


a F! H 


Y 


0 


S 


G 


..,.R 


V 


c* 


CGG 


D 


A 


V 


u 


_-R_ 


H 


D 


3781/1261 
















3811/1271 
















GCC .GAG CAG 


CGG 


GCT 


GGG 


CTC 


GAA 


CAC 


GTC 


CAT GCA GGA 


CGC 


GTT . 


CAA 


CCT 


GGC 


ATG 


GAA 


P s s 


G 


L 


Q 


S 


N 


T 


S 


MOD 




r 


N 


— 


A 


W 


K 


3841/1281 , 
















3871/1291 

















GAT CGC GTT 


CGT 


CGT 


GAA 


GGG 


GTA 


TGC 


AGG 


ACC GGG TCT 


GCT 


CGA 


GTC 


CTA 


CTC 


TCC 


TGA 


T A F 


V 


V 


K 


G 


Y 


A 


G 


P G L 


L 


E 


S 


Y 


S 


P 


£ 


3901/1301 
















3931/1311 
















GCG TGT TCC 


GGT 


CGG 


CAA 


ACA 


GAT 


CGT 


CGC 


TCG CGC CAA 


CCA 


GTC 


CCG 


CAA 


GGA 


CTA 


CGC 


R V P 


V 


G 


K • 


0 ' 


I 




A 


RAN 


0 


S 


R 


K 


. D 


V 


A 


3961/1321 
















3991/1331 
















CGG GCT GCG 


CGA 


ATG 


GTT 


CGA 


TCA 


CGA 


GAG 


CGA CGA CCC 


GGT 


CGC 


CGC 


CGG 


CCT 


GGC 


AAA 


a T. R 




W 


F 


D 


H 


XT 


S 


D D P 


V 


A 


A 


G 


L 


A 


K 


4021/1341 
















4051/1351 
















GTT GAA GGA 


ACC 


CTC 


GTC 


CGA 


AGG 


TGT 


TGC 


TCT GCG TGA 


GCG 


GCT 


GTA 


CGA 


GGC 


GCT 


GGA 


L K E 




S 


S 


E 


G 


V 


A 


L R E 


R 


-._L.__ 


Y 


rr 


A 


L 




4081/1361 
















4111/1371 
















GGT GAA GAA 


CGC 


CGA 


ATT 


CAA 


CGC 


CCA 


GGG 


CGT CGA ACT 


CAA 


CCA 


GCG 


CTA 


CAC 


CTC 


GTC 


V K N 


A 


E 


■p 


N 


A 


0 


G 


V E L 


N 


0 


R 


V 


T 


S 


S 


4141/1381 
















4171/1391 
















CGC GGT CGT 


TCC 


CGA 


CCC 


CGA 


GGC 


GGG 


CGA 


GGA AGT GTG 


GGT 


GCG 


CGA 


TCG 


TGA 


GCT 


GTA 


A V V 


P 


D 


p 


E 


A 


G 


E 


E V W 


V 


R 


D 


R 


E 


•L 


Y 


4201/1401 
















4231/1411 
















CCT GCA GGC 


CAC 


CAC 


CCG 


GCC 


GGG 


CGC 


GAA 


GCT GCC GCA 


TGC 


GTG 


GCT 


GGT 


CGG 


CGC 


CGA 


r. O A 


T 


T 


R 


r> 


G 


A 


K 


L P H 


A 


W 


L 


V 


G 


A 


D 


4261/1421 
















4291/1431 
















CGG AAC CCG 


CAT 


CTC 


CAC 


CCT 


CGA 


CGT 


CAC 


CGG CAA GGG 


AAT 


GAT 


GAC 


CCT 


GCT 


GAC 


CGG 


G T R 


.1 


S 


T 


L 


D 


V 


T 


a K G 


M 


M 


T 


L 


L 


T 


G-. 


4321/1441 
















4351/1451 
















ACT CGG CGG 


CCA 


GGC 


ATG 


GAA 


GCG 


TGC 


CGC 


CGC CAA ACT 


CGA 


CCT 


GCC 


GTT 


CCT 


GCG 


GAC 


L G G 


0 


A 


W 


K 


K 


A 


A 


A K T. 


D 


L 


P 


F 


L 


R 


T 


4381/1461 
















4411/1471 
















CGT CGT TGT 


CGG 


CGA 


ACC 


CGG 


CAC 


CAT 


CGA 


CCC TTA CGG 


ATA 


CTG 


GCG 


GCG 


GGT 


CCG 


CGA 


V V V 


G 


E 




G 


T 


T 


D 


F Y G 


Y 


H 


R 


R 


V 


R 


D 


4441/1481 
















4471/1491 
















CAT CGA CGA 


GGC 


CGG 


CGC 


CCT 


GCT 


CGT 


GCG 


GCC CGA CGG 


CTA 


CGT 


CGC 


GTG 


GCG 


ACA 


CAG 


TDK 


A 


G 


A 


T, 


L 


V 


R 


P D G 


Y 


V 


A 


W 


R 


H 


S 


4501/1501 
















4531/1511 
















TGC TCC GGT 


CTG 


GGA 


CGA 


CAC 


CGA 


AGC 


GCT 


CAC CAG CCT 


CGA 


GAA 


CGC 


TCT 


CAC 


CGC 


GGT 


.A ? - V 


W 


D 


D 


T 




A 


L 


T S L 


E 


N 


A 


r. 


T 


A 


V 


4561/1521 
















4591/1531 
















CCT CGA CCA 


CTC 


GGC 


CAG 


CGA 


CAA 


CGG 


GAA 


CCC GAG CGG 


CAC 


AAA 


CGA 


GCC 


GCA 


GTA 


CAG 



THIS PAGE BlAm mm) 




L D _H — 




A 


s 


D 


N 


G 


N 


P S G 


T 


M 






— U — 


Y 


S 


4621/1541 
















4651/1551 
















^/^/^ f^t^f^ 
CAC CCG GGC 


CGT 




GAT 


CGT 


CGT 


TCC 


GCA 


CGT TAC CGC 


CGA 


CIT' 21 






A^V- 


AGC 


TTC 


T R A 


V 


P 


X 


V 


V 


P 


H 


VTA 


E 


r\ 

U 


A 


A 


■a 


A 


S 


















• HMSH 














46ol/ i3ol 
















4711/1571 
















CGC • CAC eCva 


CAC' 


CAC 


CAC 


AGT' 


CGA 


GGG 


AGA 


v_ v_ /\ 1. Vj 










Av— V— 


•AGO- G?e 


RHP 


H 


H 


H 


S 


R 


G 


R 


PPM 

c IT n 




R 


p 


V 


T 


c 


V 


A T R 


T 


T 


T 


V 


E 


G 


E 


N R • 




V 


T 


* 


o 

IT 


A 


o 


4 741/1581 
















4771 / I SQl 

*t / / 1. / X.J J X. 
















TGG GAC GAG 


CTG 


AAC 


CAG 


GTC 


GAG 


TTC 


AGC 






CAG 


GCC 


GvjC 


CCC 


TAC 




W D D 


L 


N 




Y 


£ 


F 




Q G r 




V 


A 








R 


4801/1601 
































ACC CGA TAG 


CTG 


CAC 


GCC 


GGC 


GAT 


TCG 


TCC 


AAvj t-L.t- 




ATC 


CTG 


CTG 


CAC 


GGC 


ATC 


TRY 


L. 


H 


A 


G 


D 


s 


S 


KPT 












G 


T 


4861/1621 
















4oyi / ibJi. 
















ACC GGC CAC 


GCC 


GAG 


GCG 


TAC 


GTG 


CGC 


AAT 


CTG CGC TCG 


CAT 


TCC 


GAG 


CAC 


TTC 


AAC 


GTC 


T G H 


A 


E 


A 


V 


V 


R 


N 


r. R S 


H 


S 






_i 


N 


V 


4921/1641 
















4951/ib5i 
















TGG GCA ATC 


GAC 


TTC 


ATC 


GGC 


CAC 


GGC 


TAT 


TCG ACC AAG 


CCC 


GAC 


CAC 


CCG 


CTC 


GAG 


ATC 


W A I 


D 


F 


I 


G 


H 


G 




S T K 




D 


H 




L 


E 


X 


4981/1661 
















50 11/157 1 
















AAG CAC TAG 


ATC 


GAC 


CAC 


GTG 


CTG 


CAG 


TTG 


CTG GAC GCC 


ATC 


GGC 


GTC 


GAG 


AAG 


GCC 


TCG 


K H Y 


T 


D 


H 


V 


T. 


Q 




L D A 


I 


G 


V 


E 


K 


A 


S 


5041/1681 
















507 1/169 1 
















TTT TCC GGG 


GAG 


TCT 


CTC 


GGC 


GGT 


TGG 


GTC 


ACC GCC CAG 


TTC 


GCG 


CAC 


GAC 


CAT 


CCC 


GAG 


F S G 


■p 


S 


L 


G 


G 


W 


V 


T A Q 




A 


H 


D 


K 


p 




5101/1701 
















5i->i / i / ii 
















AAG GTC GAC 


CGG 


ATC 


GTG 


CTC 


AAC 


ACC 


ATG 


f^j^^ jrmf^J^ -..f-tf^ 

GGC GGC ACC 




GCC 


AAC 


CCT 


CAG 


GTG 


ATG 


K V. D 


R. , 


•I 


V 


L. 


N 


T 


M 


G. . G T 


M 


A 


N 


? 




V ■ . 


^ — '- 


5161/1721 
































GAA CGT CTC 


TAT 


ACC 


CTG 


TCG 


ATG 


GAA 


GCG 


GCG AAG GAC 


CCG 


AGC 


TGG 


GAA 


CGC 


GTC 


AAA 


t:: R L 


Y 


T 


L 


S 


M 


v 


A 


A K D 


P 


s 


W 


E 


R 


V 


K 


5221/1741 
















5251/1751 
















GCA CGC CTC 


GAA 


TGG 


CTC 


ATG 


GCC 


GAC 


CCG 


ACC ATG GTC 


ACC 


GAC 


GAC 


CTG 


ATC 


CGC 


ACC 


A R L 


E 


W 


> 


M 


A 


D 




T M V 


T 


D 


D 


L 




P 


T 


5281/1761 
















5311/1771 
















CGC CAG GCC 


ATC 


TTC 


CAG 


CAG 


CCG 


GAT 


TGG 


CTC AAG GCC 


TGC 


GAG 


ATG 


AAC 


ATG 


GCA 


CTG 


R 0 A 


T 


F 


. .0 


.0 


P 


D 


W 


L K A 


C 


E 


_M 


N 


M 


..A 




5341/1781 
















5371/1791 
















CAG GAC CTC 


GAA 


ACC 


CGC 


AAG 


CGG 


AAC 


ATG 


ATC ACC GAC 


GCC 


ACT 


CTC 


AAC 


GGC 


ATC 


ACG 


O D T. 


E 


T 


R 


K 


R 


N 


M 


T T D 


A 


T 


I. 


N 


G 




T 


5401/1801 
















5431/1811 
















GTG CCC GCG 


ATG 


GTG 


CTG 


TGG 


ACC 


ACC 


AAG 


GAC CCC TCC 


GGT 


CCG 


GTC 


GAC 


GAA 


GCC 


AAG 


V P A 


M 


V 


L 


W 


T 


T 


K 


DPS 


G 


P 


V 


D 


E 


A 


K 


5461/1821 
















54 91/1831 
















CGC ATC GCC 


TCC 


CAC 


ATC 


CCG 


GGC 


GCC 


AAG 


CTG GCC ATC 


ATG 


GAG 


AAC 


TGT 


GGC 


CAC 


TGG 


R I A 


s 


H 


T 




G 


A 


K 


T. A T 


M 




N 


C 




H 


H 


5521/184 1 
















5551/1851 
















CCC CAG TAC 


GAG 


GAC 


CCC 


GAG 


ACC 


TTC 


AAC 


AAG CTG CAT 


CTG 


GAC 








GGT 


CGC 


P Q Y 




D 




E 


T 


F 


N 


K L H 


L 


D 








G 


R 


5581/1861 
















5611/1871 
















AGC TGA CAC 


AGA 


CCC 


CGG 


CCG 


GTG 


CCG 


CCA 


ACC CCT GCA 


ACC 


CGG 


GCG 


GCA 


CCG 


GCC 


GGA 


S * H " 


R 


p 


R 


P 


V 


p 


? 


T ? A 


T 


R 


A 


A 


p 


A 


G 


A D T 


D 


p 


G 


R 


C 


R 


Q 


P L Q 


P 


G 


R 


H 


R 


P 


D 


L T Q 


T 


P 


A 


G 


A 


A 


N 


P C N 


P 


G 


G 


T 


G 


R 


X 


5641/1881 
















5671/1891 
















TCT CAC TTA 


CCC 


GAC 


CTA 


TTG 


CGC 


TCT 


CGT 


CCG GAC CCC 


CGG 


AGA 


GAA 


AGC 


GCC 


GAA 


GCA 
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Catechol 2 , 3-dioxygenase 
5701/1901 5731/1911 



GCA GCA AGG 
S K E 


AGA 
T 


CCG 
A 


CCG 
A 


CGA 
M 


TGC 

P 


CTG 
V 


TAG 
A 


CGC TGT GCG 

-L C A 


CGA 
M 


TGT 
S 


CGC 
H 


ACT 
S 


CCC 
P 


CCC 

L 


TGA 
M. 


5761/1921 
TGG GAC GCA 
a R N 


ACG 
D 


ACC 
P 


CCG 
E 


AAC 
0 


AGG 
E 


AAG 
V 


TCA 
X 


5791/1931 
TCG ACG CCG 
n A V 


TCG 

D 


ACG 

A 


CCG 

- A 


CAT 


TCG 
D 


ACC 

H 


ACG 

A 


5821/1941 
CGC GCC GGT 
R R F 


TCG 
V 


TCG 
A 


CCG 
D 


ACT 
F 


TCG 
A 


CCC 
P 


CCG 
D 


5851/1951 
ATC TCA TCG 

I, T V 


TCA 

T 


TCT 


TCG 
A 


CCC 
p 


CCG 
D 


ACC 

H 


ACT 

V 


5881/1961 
ACA ACG GCG 

n~'"g V 


TCT 

F 


TCT 

Y 


ACG 

D 


ACC 

T. 


TGC 

L 


TGC 
P 


CGC 
o 


5911/1971 
CGT TCT GTA 

F C T 


TCG 

G 


GTG 

A 


CCG 

A 


CCG 
A 


CGC 
0 


AGT 
S 


CCG 

V 


5941/1981 
TCG GCG ACT 
G D Y 


ACG 
G 


GCA 
T 


CCG 
c* 


AAG 

A 


CCG 
G 


GCC 
o 


CTC 
L 


5.971/1991 
TCG ACG TCG 
DVD 


ACC 
R 


GTG 
D 


ACG 
A 


CCG 
A 


CCT 

V 


ACG 

A 


CAG 
V 


6001/2001 
TCG CCC GCG 
A R D 


ACG 

V 


TCC 
L 


TCG 

D 


ACA 
S 


GCG 
G 


GCA 

I 


TCG 
D 


6031/2011 
ACG TCG CAT 

V A r 


TCT 

S 


CCG 


AAC 


GCA 

M 


TGC 

« 


ACG 

V 


TCG 

D 


6061/2021 
ACC ACG GAT 
H G F 


TCG 
A 


CCC 
.0 


AAG 
A 


CAC 

T. 


TCC 

0 


AAT 


TGC 
T. 


6091/2031 
TGG TCG GAT 
V G S 


CGA 

T 


TCA 
T 


CCG 
A 


CCG 
V 


TGC 
P 


CGA 
T 


CCG 
V 


6121/2041 
TGC CGA TCT 
P T F 


TCA 
I 


TCA 
N 


ATT 

S 


CGG 

V 


TCG 
A 


CCG 
E 


AAC 
P 


6151/2051 
CGC TCG GCC 

L C, P 


CGG 
Y 


TCA 
S 


GCC 
R 


GGG 
Y 


TAG 
R 


GGC 

L 


TGC 


6181/2061 
TCG GCG AGG 

G E A 


CGG 
V 


TCG 
G 


GGC 

R 


GGG 

A 


CCG 
A 


CTG 

A 


CCA 
K 


6211/2071 
AGC TGG ACA 

L D K 


AGC 


GTG 

V 


TGC 

L 


TGT 


TCG 
V 


TCG 

G 


GAT 

s 


6241/2081 
CCG GCG GCC 
G G L 


TGT 
S 


CCC 
H 


ACG 
D 


ACC 
P 


CGC 


CGG 
V 


TCC 
P 


6271/2091 
CGC AGT TCG 
0 F A 


CCA 

T 


CCG 

A. 


CGC 
P 


CAG 


AGG 
E 


AAG 
V 


TGC 
R 


6301/2101 
GCG AGC GGT 
E R L 


TGA 
I 


TCG 
D 


ACG 
G 


GCC 
R 


GCA 
N 


ATC 
P 


CCA 

s 


6331/2111 
GTG CCG CCG 
A A • E 


AAC 
R 


GTG 
D 


ATG 
A 


CCC 
R 


GCG 
E 


AAC 
0 


AGC 
R 


6361/2121 
GCG "TCA TCA 
V T T 


CCG 
A 


CCG 
G 


GGC 
R 


GGG 
D 


ACT 


TCG 
A 


CCG 
A 


6391/2131 
CCG GCA CCG 
GTA 


CCG 
A 


CCA 

T 


TCC 
0 


AGC 
P 


CAC 

r. 


TGA 
N 


ACC 


642 172 141 
CCG AAT GGG 
E W D 


ACC 
R 


GGC 
H 


ACC 


TGC 

I. 


TCG 

D 


ACG 
V 


TCC 
h 


6451/2151 
TCG CCT CCG 
A S G 


GCG 
D 


ACC 

L 


TCG 

E 


AGC 
Q 


AGA 
I 


TCG 
D 


ACG 
A 


6481/2161 
CGT GGA CCA 
W T N 


ACG 
D 


ACT 
W 


GGT 

^. F 


TCG 
V 


TCG 
E 


AAC 
0 


AGG 
A 


6511/2171 
CCG GAC ACT 
G H S 


CCT 
S 


CCC 
H 


ACG 

E 


AAG 
V 


TGC 
R 


GCA 

'T 


CCT 
W 


6541/2181 
GGA TCG CCG 
X A A 


CGT 

Y 


ACG 

A 


CGG 

A 


CAA 

M 


TGA 
S 


GCG 

A 


CCG 
A 


6571/2191 
CCG GGA AGT 

G K Y 


ACC 
R 


GCG 
Y 


TCA 

T 


CCT 
S 


CGA 
T 


CCT 


TCT 

V 


6601/2201 
ACC GCG AAA 
RET 


TCC 
H 


ACG 

s 


AGT 
W 


GGA 

T 


TAG 
A 


CAG 
G 


GAT 

F 


6631/2211 
TCG GGA TTA 

GIT 


CTA 

T 


CCG 

A 


CCG 
V 


TCG 
A 


CCG 
V 


TCG 
D 


ACG 
S 


6661/2221 
AAT AGA CCC 
* T ? 


CGC 
P 


CGC 
L 


TCC 
p 


CGC 
P 


CCC 
R 


GCA 
S 


GTC 
p 


6691/2231 
CCA ACG AAG 
N E G 


GGT 
W 


Alcohol dehydrogenase 
GGC CCC GGA TGA CCT CCG 
P R M T S V 


6721/2241 
TCC GCC CGT 
R P C 


GCT 
S 


CGC 

T5 


CGT 
S 


CGG 
V 


TGA 
M 


ACG 
A 


CGG 
G 


6751/2251 
GCT GGT CGG 
W S V 


TGG 
G 


GCA 
R 


GGA 

K 


AGA 

T 


CCT 

S 


CAT 
S 


CGC 
o 


6781/2261 
CGA CAT CGC 


CCT 


CGA 


CCT 


CGC 


AGC 


TCG 


TCA 


6811/2271 
GTA GGA ATG 


CGC 


ACG 


GGC 


CGA 


CGA 


GTC 


GCG 
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T ^ P 




T 


s 


w~ 




\f 




R N A 








i 


Q 




A 


684 1/2281 
















6871 /P7Q1 
















CTG GTC ACC 


GGG 


GCC 


AGC 


CGC 


GGC 


ATC 


GGG 


GCG GCC ATC 


GCA 


GAT 


GCG 


GTG 


GCC 


GCC 


TCC 




Q 


n 


p 


p 

n_ 


u 


.. .Jtv , 


.... 


G H R 










p 


T 


S 


con 1 /7 10 1 
































GGT GCC GCC 


GTA 


ATC 


GTC 


CAC 


TAC 


GGA 


TCC 


GAT CGG ACG 


GCC 


GCC 


GCT 


GCG 


GTG 


TCG 


ACG 


O D 

U it r\ 


N 




-f 


L 


R 


X 


R 


c r\ 

^ D Lj 


R 


R 


c 


G 


V 


D 


G 


















oyyi/ZJJl 
















GCA TCA CGG 


CTG 


CCG 


GGG 


GCC 


TCG 


CGG 


CTG 


CGG TCC AGG 


CCG 


ACC 


TGT 


CCC 


GAC 


CCG 


AGG 


I . T - A 


A 


G 


G 




A 


A 


A 


V • Q A 






S 


R 


■P 


... .-g.. 




7021/ zJ41 
















7051/2351 
















GGC CTG AAG 


AGC 


TGA 


TGC 


GGG 


AGT 


TCG 


ACT 


CCG CGC TCG 


ACG 


GTC 


TCG 


GGC 


TCG 


ACC 


GAG 






M 


R 






D 


S 


A L D 


G 




G 




D 


R 


G 


/Ool/2Joi 
















7111/2371 
















GGC TCG ACA 


TCC 


TCG 


^CA 


ACA 


ACG 


CCG 


GAA 


TCA GTC GGC 


GCG 


GAG 


CGC 


TCG 


AGC 


GCG 


TCA 


L D I 




V 


N 


N 


A 


G 


I 


S ?. 


G 


A 






R 


V 


T 


/ 14 l/ZJOi 
















"71 "71 /TIQl 

/i/i/2jyi 
















CTG TCG AGG 


ATT 


TCG 


ACC 


GTC 


TGG 


TCG 


CAC 


TCA ACC AGC 


GCG 


CCC 


CGT 


TCT 


TCG 


TGA 


CTC 


V E D 




D 




Li 


V 


A 


Ii 


NOR 


A 








V 


T 


R 


/zUi/ ZHUi 
















/2Jl/z4ii 
















GGC ATG CCC 


TGC 


CCC 


GGA 


TGC 


ACG 


ACG 


GCG 


GTC GCA TCG 


TCA 


ACA 


TTT 


CCT 


CCG 


GAT 


CCG 


HAL. 




R 


M 


H 


D 


G 


G 


R I V 


N 


X 


S 


S 


G 


S 


A 


















/zyi/Z4ji 
















CCC GCT ACG 


CCA 


GAC 


CCG 


ACG 


TCA 


TCA 


GCT 


ACG CCA TGA 


CCA 


AGG 


GGG 


CGA 


TCG 


AGG 


TGC 


R y A 


K 




D 


V 


T 


S 


Y 


A M T 




G 


A 






V 




















7jbi/Z451 
















TCA CCC GCG 


CCC 


TCG 


CCG 


TAG 


ACG 


TCG 


GCG 


AAC GAG GCA 


TCA 


CCG 


CCA 


ACG 


CCG 


TGG 


CGC 


T R A 





A 


V 


D 


V 


G 




R G X 






M 


A 


V 


A 




/Jol/ zlbi 
















/4ii/Z4 / i 
















CGG CCG CGC 


TCG 


ATA 


CCG 


ACA 


TGA 


ACG 


CGC 


ACT GGC TTC 


GCG 


GTG 


ACG 


ACC 


ATG 


CCC 


GCA 


A A L 


D 


T 


D 


M 


N 


A 


H 


W L R 


G 


D 


D 


H 


A 


R 


T 


7441/2481 
















7471/2491 
















CCA CCG. CCG 


CGT 


CCA 


CCA 


CTG 


CAC 


TGC 


GAA 


AAC .TCG CCA 


CCG 


CGG 


AGG 


ACA 


TCG 


.CCG.. CGA._ 


T A A 


S 


T 


T 


A 




R 


K 


L A T 


A 




D 


X 


A 


A 


T 


7501/2501 
















7531/2511 
















TCG TGG CCT 


TCC 


TCG 


TCA 


GCG 


CCG 


CCG 


CCG 


GTG CGA TCA 


CCG 


GGC 


AGG 


TCA 


TCG 


ACG 


CCA 


V A F 




V 


S 


A 


A 


A 


G 


AIT 


G 


Q 


V 


I 


D 


A 


T 


7561/2521 
















7591/2531 
















CCA ACG GCA 


ACC 


GGC 


TCT 


AAC 


CAG 


AAC 


TTA 


CCC GGT CCC 
















NGN 


R 




* 


P 


E 


L 


T 


R S 
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Jacquiau & J. Archer 



Plasmid name: p^*^ 
Plasmid size: 8.51 kfc 
Constructed by: J. Poweir 
Construction date: 6.6.97 

Comments/References: mycolic acid bacterium - Escherichia coli mobilizabie 
cosmid vector. Carries pCYl04 replionS. Kanamycin resistant 15 jig/ml mycolic acid 
bacteria; 50 |Ag/mi Escherichia coli. CarriesM^mbda cos site. RP4 oriT site. Multiple 
cloning site. 
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c 

1 I C-^ - 



G 



Patenf^pecification^GTERi^lx^ 




pBR322 oriV 



RP4 oriT 



XbaV 5.42 



Plasmid name: pJP7 
Plasmid size: 10^6kb 
Constructed by: J. P 
Construction 



Archer 



owell & J. 
date: 9/2>~9 7 

Comments/References: MobHizable E.coii/Rhodococcus/Nccardia suicide/ 

luciferase integratio>si(ector encoding lu^B signal genes, 
sacB ge>e.and thiostreppjon resistance in^odococcus / 
Nocardia only^up to 75 |ig/ml typically l-loVo/ml used in 



Nocardia onty up lo n-y^r LyK.v-«..j . - 
selections. RF4<RK2 mobilizabl^. By cloning dnegion of 
homoloqv into th^region upstream of the luxAB cassette 



homology into the"^ region upstrear 
insertoin can be targeted. 



n4 iz_ 



rigufc 3 



1 
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